Corpus: mwl_wikipedia_2014_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 3371 a-
2 2636 c-
3 2198 p-
4 2118 s-
5 1521 d-
Top Character Bigrams
word rank frequency n-gram
1 1064 re-
2 1048 an-
3 1010 cu-
4 709 pr-
5 637 de-
Top Character Trigrams
word rank frequency n-gram
1 579 cun-
2 304 ant-
3 300 pro-
4 272 cum-
5 266 per-
Top Character 4-Grams
word rank frequency n-gram
1 181 cump-
2 179 cunt-
3 162 ante-
4 158 cuns-
5 103 qu'a-
Top Character 5-Grams
word rank frequency n-gram
1 87 anter-
2 85 cuntr-
3 57 cunsi-
4 55 cumpr-
5 54 cunse-
478 msec needed at 2018-01-05 22:09